DOCOHENT RESUME 



ED 095 221 



95 



TH 003 907 



AUTHOR 
TITLE 



INSTITUTION 

SPONS AGENCY 

PUB DATE 
CONTRACT 
NOTE 



EDRS PRICE 
DESCRIPTORS 



Wunderlich, Kenneth W,; Borich, Gary D, 
Curvilinear Extensions to Johnson-Neyman Regions of 
Significance and Some Applications to Educational 
Research. 

Texas Univ., Austin. Research and Development Center 
for Teacher Education. 

National Inst, of Education (DHEW) , Washington, D.C.; 
Office of Education (DHEW) , Washington, D.C. 
[Apr 74] 

NE-C-00-3-0066; OEC-6-10-108 

26p. ; Paper presented at the Annual Meeting of the 
American Educational Research Association (59th, 
Chicago, Illinois, April 1974) 

MF-$0.75 HC-$1.85 PLUS POSTAGE 

Computer Programs; Educational Research; Hypothesis 
Testing; Mathematical Models; *Multiple Regression 
Analysis; *Statistical Analysis; *Tests of 
Significance 



ABSTRACT 

Considerable thought, research, and conce 
expanded in an effort to determine whether the assumption 
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violated the assumptions which Johnson and Neyman (1936) 
calculating regions of significance about interacting reg 
particular, there has been special concern for the assump 
linearity. Debate has ranged from whether linearity refer 
functional relation of the criterion and predictor to whe 
in the context of a linear statistical model, if not both 
extends the Johnson-Neyman technique to the curvilinear c 
illustrates this extension by reanalyzing data from two p 
published research studies which have considered only a 1 
relationship between a covariable criterion. A computer p. 
included in the appendix. (Author/RC) 
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Curvilinear Extensions to Johns on-Neyman Regions of Significance 
and Some Applications to Educational Research 

Kenneth W. Wimderlich and Gary D. Borich 
The University of Texas 

Consider an experiment in which there are two treatments, one criterion 
and one covariable, and that the researcher would like to detect a co- 
variable by treatment interaction. In order to detect this interaction, 
the researcher can (1) divide the covariable into any number of discrete 
categories (i.e., blocks) and perform a Treatment x Blocks analysis of 
variance (Edwards, 1968) or (2) leave the covariable a continuous measure 
and test for the homogeneity of group regressions (Walker and Lev, 1953) . 
For the homogeneity of group regressions analysis the researcher employs 
each covariable value as a discrete unit of measurement and thereby avoids 
losing information by assigning different covariable values to the same 
block. 

Cronbach and Snow (1973) have shown that for the case in which there 
is a moderately strong interaction, the statistical power of the homo- 
geneity of group regressions test is superior to blocking at the median, 
blocking at the 33rd and 67th percentiles, or similar configurations that 
may be employed in a Treatment x Blocks design. Classification schemes 
such as these discard power by treating dissimilar data the same, causing 
the risk of accepting a false null hypothesis to increase beyond that 
which can be expected when the homogeneity of^ group regressions test is 
applied. 

When group regressions are heterogeneous, i.e., E^ chooses not to 
employ a factorial design, Johnson and Neyman's (1936) procedure can be 
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used to determine the region(s) of covariable values for which treatments 
are significantly different. These regions allow covariable scores to be 
used to determine the treatment for which each is best suited or if, for 
any given S_, there is no best treatment. This technique is used generally 
in the study of aptitude-treatment or trait-treatment interactions and is 
considered the preferred methodology for such studies (Berliner & Cahen, 
1972; Cronbach & Snow, 1973). 

Considerable thought has been expended in an effort to determine 
whether the assumption of a quadratic relation between a single predictor 
and a criterion violates the assumptions which Johnson and Neyman (1936) 
state for calculating regions of significance about interacting regres- 
sions. In particular, there has been special concern for the assumption 
of linearity. One may ask whether "linearity" refers to the functional 
relation of the criterion and predictor or whether it is in the context of 
a linear statistical model, if not both. The computational procedure, 
however, reveals no need for the former assumption of linearity, thereby 
providing the foundation for extending the Johnson^Neyman procedure for 
determining regions of significance to the curvilinear case. 

Johnson and Neyman (1936, p.. 73) in discussing the difference, 6, 
between the regression equations for two experimental groups as 0(x,y) = 
fj_(x,y) - f^(x,y) where f^^ and f^ are functions of predictor variables 
X and y, comment: 

Tie functions f^ and f^ may be chosen arbitrarily, according to the 
conditions of the particular problem, with the only theoretical re-, 
striction that both functions f and f2 must be linear with regard 

to the unknown constants they involve. Thus we could assume that, 

e.g., fj^(x,y) = + A^cosxy + A^e^"^^ but our method would fail if 

y 

it were necessary to assume f ^^(xy) = cosA^x + since here the 
dependence upon A^^ and A^ is not linear. 
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Most crucial is the position of the coefficient of the variable term, 

cosxy. When placed before the variable term as a multiplicative factor, 

this term makes the equation consistent with what is known mathematically 

as a linear equation. The fitting of curvilinear regression equations 

2 P 

defined by polynomials of the form Y = b + b x + b x + . . . + b x is 

Q 1 2 p 

in principle no different from the fitting of multiple regression equa- 
tions. 

Developmen t of Curvilinear Formulae 

Group Regressions 

Given groups 1 and 2 with regression equations of the form Y - 

? ^ 2 

A-j^ + B^X + C^X and ^2 ^2^ ^2^ where the subscripts 1 and 2 

refer to groups 1 and 2 respectively, the first consideration in the 
series of hypotheses related to the Johns on-Neyman procedure is that of 
homogeneity of criterion variance. The second consideration is that of 
heterogeneity of regression. Johnson and Jackson (1956), in their two- 
predictor case, provide the formula for computing the variance of the 
actual scores about the predicted scores and the variance of the observed 
scores about a regression line with a common slope in order to compute an 
F-ratio for homogeneity of group regressions. If the probability level 
associated with this F-ratio is found to be significant, then the Johnson- 
Neyman test for regions of significance is recommended. 

The points at which the regression curves intersect, indicating the 
points of no difference, can also be determined. The computational forms 
to be employed in this series of steps is as follows • 
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Regions of Significance 

f Tae difference in regression curves is expressed as (A^^ - A^) + 

(d. - B^)X + (C - C )X^ * D. Setting D « 0 yields a quadratic equation 

o 

(A. - A ) + (B - B )X 4- (C. - C )X = 0 which can be solved. Application 
1 2 12 i 2 

of the quadratic formula, - B ± "/ - 4AC , yields two zeroes assuming the 

2A 

term B - 4 AC is non-negative. If this term were negative, there would be 
no point at which the two curves intersect. Such an occurrence is feasible 
as shown in Figure 1. 



Insert Figures 1, 2 and 3 



If - 4AC = 0, then Figure 2 is most likely and, if B^ - 4AC > 0, 
then Figure 3 is likely. 

To determine regions of significance there are several possibilities 
based on which of the situations outlined above is considered. Given 
Figure 1 or 2 one would most likely find a left and right boundary for the 
region of non-significance as indicated by the shaded portion. To deter- 
mine these points along the X-axis, consider the formula: 



where S ^ is the best estimate of the error variance obtained by pooling 
the variances about the group regression lines and ^ ^ - ^ is the variance 
for the difference in regression lines where D « (A^^ - A^) + 
(Bj^ ~ B^)X + (Cj^ - C^)X^ and where P + Q is a "scaling" factor used to 
determine the variance for the difference between the regression equations 
for the two groups. The value of (P + Q) depends upon the values of the 
basic characteristics of matching the x's and y's. Where x' and y' lie 
near the population means of x and y, the value of (P + Q) becomes small 
since (P + Q) becomes ^ + when x = x' « and y = y' = y^. is of 



the general form Ax'^ + BX-^ + CX^ + DX + E so that the equation 

/ D^\/^s/\ |(P + Q)S,^] 
(?-rq)/{W^)= F.05<l.N-6) expressible as - F^Qg = 0 is a 

fourth order (quartic) equation. This is consistent with the illustrations 
presented earlier. If four distinct solutions exist then two regions of 
non-significance are defined as indicated in Figure 3. To find the solu- 
tions to this equation, a method suggested by Standar d Mathematical Tables 
(1965) is available which requires finding a solution of a resolvent cubic 
equation first but which in turn can be used to obtain all four roots of 
the above fourth order equation. The following formulae are used in 
finding these values. 

4 3 Z 1 
Given the quartic equation (1) x + ax +bx +cx + d = 0,a form 

2 ^ 

which the equation obtained from D can assume, a resolvent cubic equation 

of the form (2) y"^ - by^ + (ac - 4d)y - a^d + 4bd - c^ « 0 is obtained. 

A root of this equation is obtained by reducing equation (2) to the form 
3 

(3) z + fz + g = 0; the "second" order term is eliminated by substituting 
(z + \) for y in (2) where f ^t3(ac - 4d) - b^] and 



g ~[2b^ - 9b (ac - 4d) + 27(-a2d + 4bd - c^)| . Let A =^z£,+^^+ ^ 
J 2 fA 27 

=yr - b + M. 

D rjl^ r2 - 2b + - ^^^"^ and E - - 



and B =^^-g "Vg^ . f_ ; then A + B = M is a root of equation (2) from 
J 2 "/4 27 

which the roots for (1) can be obtained. Consider R - b + M, if R ^ 0, 



4ab - 8c -a^ 

- ■ , 4R 

Otherwise D - 2b + 2^ - 4^ and E - 2b - zjli^ - 4^ . The four 

roots of the fourth order equation are then x • ^ + |: ± ^ and x - ^ - ~ ± ^ . 

4 2 2 4 2 2 

These roots are real dependent upon the arguments for the square roots above 
being greater than or equal to zero. 
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Research Applications 

The above computations may be simplified through the use of a computer 
program (Wunderlich and Borich, 1973) which, after solving the fourth order 
equation for determining regions of significance, plots the data points, 
within-group regressions and regions of significance for the case in which 
there are two treatments, one criterion and one predictor. Before applying 
the program, however, the investigator must consider the full range of 
calculations, both linear and nonlinear, that may be required by any given 
data set. These calculations are diagrammed sequentially in Figure 4. 

Insert Figure 4 about here 

Note that the sequence begins by testing the assumption of curvilinear ity 
and that criterion-covariable relationships may be curvilinear for groups 
separately or combined. In one case the combined groups may exhibit a 
single underlying curvilinear relationship such that the two groups are 
samples from the same population, i.e., regression slopes for the groups 
are homogeneous. In a second case the relations between the covariable 
and criterion may be curvilinear for each group separately, i.e.,, regres- 
sion slopes for the groups are heterogeneous. Only for the latter case 
are regions of significance tenable. Whereas the foregoing discussion 
addresses both homogeneity of curvilinear regressions and regions of 
significance, it does not cover a third opportunity to detect a curvilinear 
relationship. In a third case regressions that are both linear and non- 
intersecting (ordinal) may mask an underlying curvilinear relationship 
in which linear slopes of both groups form a single curvilinear trend. 
The continuity of separate linear slopes can take the form of a single 
curvilinear regression that is easily missed when the efficiency of linear 
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and curvilinear models are not compared at the outset of the research. 
This initial comparison and the sequence of steps representing the left 
portion of Figure 4 are calculated by the curvilinear program. When 
one or both within-group regressions are linear, i.e., a curvilinear 
model fails to significantly improve prediction, the researcher is referred 
to a linear program by Borich (1971) for the cas,-^ in which there is one 
predictor or to a linear program by Borich and Wunderlich (1973) for the 
case in which there are two predictors. 

To illustrate the above computations, a research study conducted 
by Hughes and published in the American Educational Research Journal (1973, 
10, 21-37) was reanalyzed with the Wunderlich and Borich (1973) curvilinear 
program. 

Background . Hughes set out to establish that differences in residual post- 
test achievement are a function of pupils responding to teacher^' questions 
(a) randomly, (b) systematically and (c) in a self-selected style in 
which the pupil has the choice of whether or not to respond. With three 
schools available for experimentation, E chose a 3 x 3 factorial design 
with schools as the first factor and type of responding as the second. 
Criterion performance was established with a 222-item posttest based upon 
the content of the treatment, a wildlife lesson directed to seventh- and 
eighth-grade pupils. In addition, collected scores on nine covariables 
with the foresight that these may confound posttest performance. Among 
these E included the most likely predictor — that of pretest achievement 
on the criterion instrument. E chose to remove posttest variance attrib- 
utable to differences on the pretest by using as the criterion the differ- 
ence between predicted and actual posttest scores (i.e., the residuals). 
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Hughes' data provided the opportunity to uncover nin^ aptitude- 
treatment interactions. For our illustration, however, the reanalysis was 
limited to one of the nine covariables, that of the pretest and to only 
one pair of treatments, that involving the self-selected and systematic 
resporse groups. 

Results . These data, while previously reanalyzed by :4orich (1974) 
with the assumption that covariable-criterion relationships were linear, 
were reanalyzed a second time assuming that relationships between covariable 
and criterion were curvilinear. Application of the curvilinear program 
to these data, however, revealed no significant improvement in prediction 
from that obtained from a linear model for either the systematic response 
or the self-seiecced response groups and therefore the investigator was 
referred to the more suitable linear program (Borich, 1971). In the 
interest of providing a real data illustration of the curvilinear tech- 
nique, the program then was forced subsequent to the model comparisons 
test to continue as though curvilinear regressions had been found in one 
or more treatments. The program, therefore, calculated the homogeneity 
of group regressions test for curvilinear regressions, the test for a 
common intercept and regions of significance rather than "exit" as would 
be expected after the model comparison test for lack of a significant 
improvement in predictive efficiency with the curvilinear model. The 
resulting plot from the curvilinear analysis of these data is reported 
in Figure 5 and the resulting plot from the corresponding linear analysis 
by Borich (1974) is reported in Figure 6. 



Insert Figures 5 and 6 about here 
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Note that for both analyses a region of significance is indicated 
to the left of the data mass. While the region of significance for the 
curvilinear plot begins at the covariable value of 14.20 (taken from 
the printout) and the region of significance for the linear plot at the 
covariable value of 13.^^0, both regions lie in approximately the same 
area and encompass approximately the same range of covariable values. 
Tliese ranges represent those covariable values for which the treatments 
are sigiiif icantly different at or beyond the «05 level. For this example 
the curvilinear plot lends little new infomation to the study and, if 
employed, might needlessly complicate its interpretation. Using the 
linear plot, we inight conclude that those subjects scoring below 13.20 on 
the pretest should receive the self-selected response treatment while those 
falling above this value are likely to / Ic /e equally well from either 
treatment and therefore should receive ' \.3 least costly of the two treat- 
m^^'-n t s . 

Other configurations . Lest the reader receive the impression from 
this example that the present data represent the only configuration of 
curvilinear regressions and corresponding regions of significance, other 
hypothetical distributions have been analyzed and plotted and are reported 
in an appendix to this report. A listing of the program used to generate 
these plots is also provided. 
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Figure Captions 

2 

Figure 1. B - 4AC < 0. 
2 

Figure 2. B - 4AC = 0. 
Figure 3. B^ - 4AC > 0. 

Figure 4. Sequence of linear and curvilinear calculations for ATI analyses 

Figure 5. Region of significance (< 14,21) and nonsignif icance (> 14.20) 
for Hughes (curvilinear analysis). 

Figure 6. Region of significance (< 13.21) and nonsignif icance (> 13.20) 
for Hughes (linear analysis). 



Appendix 




a. A positive linear regression 
and a moderately strong curvilinear 
regression with small left and 
right regions of significance 



b. Two slightly curvilinear positive 
i^egressions with a moderately sized 
left and large right region of 
significance 




c. Two moderately strong curvi- 
linear regressions in opposite 
directions with a small left and a 
moderately sized right region of 
significance 



d. A slightly curvilinear ret,ression 
and a moderately strong curvilinear 
regression in opposite directions with 
a small left and a large right region 
of significance 



PROGRAM CUkVAT I (iN^uTi OUTPUT, PLOT) 

C (P«£LI.'^INARY VERSION, a/5/7i4) 

C CONTROL CARDS A\0 TrtE DaTA ARE ARRANCiED IN THE FOLLOWING OROtK 

C FIRST CfkROt COLS, 1«5 = NUMBER OP SUBJECTS IN GROUP I 

C COLS, 6«ia a NUMBER OF SUBJECTS IN GROUP 2 

C COLS, IS s MISSING DATA OPTION 

C IF lEHOt ALL DATA IS INCLUDED IN THE ANALYSIS 

C IF ONfc, 9LANKS ARE NOT INCLUDED BUT ZEROES ARE 

C IF TWO, BLANKS AND ZEROES ARE NOT PROCESSED 

t COLS, i6«ee ■ ALPHA CP) level used to oecioE 

C WHETHER A LINEAR OR CURVILINEAR MODEL IS APPROPRIATE 

C SECOND CAROi COLS, 1-5 • F-VALUE ASfOCIATeO WITH P, A80V£(D«f»l AND N»-3) 

C THIRD card: GROUP 1 FORMAT (PREDICTOR MUST PRECEDE CRITERION) 

C INSERT GROUP I DATA HERE 

C FOURTH CARD: GROUP 2 FORMAT 

C INSERT GROUP 2 DATA HERE 

C PROGRAMMED BY KEN W wUNOERLICH AND GARY D BORICH, RESEARCH AND DEVELOPMENT 

C CENTER FOR TEACHER EDUCATION, THE UNIVERSITY OF TEXAS AT AUSTIN 

DIMENSION Yl (23*J) ,Xi C2?0),X12(23B),y2(23;3),){2C230),X2i?C23B), Y(a50) 
l,XC4b«) /X02(ab0) /FRMTl Ca),FRMT2(8),A(40a),6(a003 ,C ( 4^0 ) , NSC < I B ) , YP 
2M(1^) ,XXCa),PXCa),XPTLNl C35),YPTLNl{35),XPTLN2(35),YPTLN2(35),XPT( 
SSS) , YPT (35), 

4QXl(4),QYl(a),QX2(4),QY2(4),QX3(4),QY3(4),QX4(4),QYa(<(),AA(5),Z(4) 
COMPLEX I 

READ 1,N1,N2,HISS,P8LV 

1 FORMAT (2I5,4X,n,F5,3) 
READ 27,FSL 

27 FORMAT (F6,3) 

READ 2, (FRMTl (I) ,lawe) 

2 FORMAT (SAIB) 

READ FRMTI,(Xl(I),Yl(n,I»l,Nl) 
CALL AMISDAT ( XI , Y I , Nl , M|SS) 
00 3 KsifNl 

3 Xl2(K)BXi(K)*Xl (K) 

CALL SUMDATA(Yl,Xl,X12,yiM,XlM,X12M,YlSD?XlS0»Xl28D» YISS,X1S8,X12S 
lS,»YXl,RYX12,RXX12«RSSYXl,RSSYX12,RS8XXU,Nt) 
READ 2, (FRMT2(I),lsi,g) 
READ FRMT2, (X2(I) ,Y2(I),I«1,N2) 
CALL AMISDAT (X2,Y2,Na, MISS) 
DO 4 K3i,N2 

4 X22(K)=X2(K)*X2(K) 

CALL 3UMDATA ( Y2, X2, X22, Y2M, X2M , X22M, Y2SD, X28D, X22SD, Y28S, X28S # X22 
ISS,RYX2,RYX22,RXX22,RSSYX2,R3SYX22,RSSXX22,N2) 

N=NUN2 

DO b K=1,N1 

X(K)=X1 (K) 

Y(K)SX2(K) 
b X02(K)sX12(K) 

DO 6 Ksl,N2 

X(N1+K)=X2CK) 

Y(Nl*K)aY2(K) 
b X02(Nl*K)sX22(K) 

CALL SUMOATA ( Y,X,X02, YM,XM,X02H, YSD,XSD,xe2SD,YS8,XS3,X02S8»RYX,RY 
lX()2,RXX02,RSSYXf RSSYX02,R8SXX02,N) 

PRINT 7 
7 FORMAT (IHU 

PRINT e,Nl ,N2,N,YlM,Y2H,YM,Yi8D, Y2SD, YS0rXlH,X2M,XH,XlSD,X2SD,XSD, 
1RYX1,RVX2,RYX 

a F0RMAT(28X,*SUMMARY STATISTICS*//, 29X,*CR0UP l*8X*GR0UP 2*9X*T0TAL 
O l*//,bX*GROUP N*20XrI4, 11X,I4, 10X,I4,//,5X*CRXTERION MEAN*9X,F8, 3 , 7 
ERIC iX,Fa,3,6X,F8,3,//,5X,*CRIT£RI0N ST, DEV , *5X,P8, 3, 7X , F6 , J , 6X,F6 , 3 ,/ 
"^3/#5X,*PHEDICT0R MEAN*9X,F8,3,7X,F6,3»6X|P«, J»//»5X,*PRE0XCT0R ST, 



aD!-'V,A5X,F6,"i,7XpFa,3#6X#F8,3#//»5X,*CQRRELATI0N Y AND X*6X»F6,4, 

AN=N 

ANl SNI 
ANlOFaANl*3,0 

AN2aN2 

AN2OF»ANa»3,0 
C MODEL COMPARISON 

RS01»(RYXl*R\rXl ♦RYXl2*fiYXl2«2,0*RYXl*Ryxl2*RXXl2)/(l,0«RXXU*RXXl2 

n 

RSQ2a(RYX2*RYX2*RYX22»HYX22»2,0*RYX2*RYX22*HXX22)/( 1 ,0*RXX22*RXX22 

n 

RlsRYXl*RYXl 
R2aRYX2*RVX2 

VAYXXlsYlSO*YlSO*Cl,0-RSQl) 
VAyxX2=Y2SD*Y2SD*(l,0-RSQ2) 
VAYXl =YiSD*VlSD*(l,0»RYXl*RYXl) 
VAVX2sYiJSD*Y2SD*(l ♦0«»RYX2*RYX2) 
FGPls (^^SCilHRD/Cd.B-RSQD/ANlDF) 
FGP2S <RSU2-R2}/C Ci,0«RSQ2)/AN2DF) 
ANOf l = M-!f2,0 
ANOF2c(v2«2,0 

SLMOOi=PRBFC 1 ,0, ANlDFiFGPl) 
SLM0D2 = PRBf^ Cl,0rAN2DFfFGP2) 
IF(SUMa0iae,P8LV.0R,SLM0O2,LE,PBLVJl7,i6 
lb PRINT 18#P8LV 

Id FORMAT C/#iX*A COMPARISON OF A LINEAR AND QUADRATIC MODEL FOR tACH 
lOF GROUPS 1 AND 2 SUGGESTS*/ /|5X , *N0 SIGNIFICANT DIFFERENCE AT THE 
2 *F'i,i* LEVEL AND THUS USE OF A LINEAR MODEL AND */ , 5X , * ANAL Y5I8 I 
3S SUGGESTED, SEE BORICHf EDUCATIONAL AND PSYCHOLOGICAL MEASUREMEN 
4T,*,/,5X,*l971,3l,P25l*2S3,*) 
GO TO 101 

17 BXla(YlSD/XlSD)*(RYXl«RYXl2*RXX12)/(l,0«RXXl2*RXX12) 
BX21s(YlSD/X12SD)*{RYX12»RYXl*RXXl2)/(l,0«RXX12*RXXl2) 
A01 = YlMf.8Xl*XlM»8X21*Xl2M 

5X2a(Y<iSD/X2SD)*CRYX2«»RYX22*RXX22)/(l,0*RXX22*RXX22) 
8X22a(Y2SD/X22SO)*(RYX22«RYX2*RXX22)/(l,0»RKX22*RXX22) 
A02sY2M-8X2*X2M-SX22*X22M 
PRINT I2f A0l,BXl,BX21 ,A02,8X2,BX22 
12 FORMAT (////, 5X,*THE REGRESSION EQUATION FOR GROUP 1 IS Y a *F6,« 
1* * *F8,a* X + *F8,a* XX,* //,5X,*THE REGRESSION EQUATION FO 
2ft GROUP 2 IS Y = *F6,«* ♦ *Fe,a* X ♦ *F8,«* XX,*) 
PTlsPT2=0,0 
ERR0ROVAYXXHVAYXX2 
SYYwsYlSS*Y2SS 
SXXWSX1SS+X2SS 
SXX2WSX12SS+X22SS 
RYXSSW=RSSYX1*R3SYX2 
RYX2SSW3R5SYXl2fRSSYX22 
RXX2SSwsRSSXX12^R$SXX22 

VAKE6RSaSYYW*( (RYXSSM**2*SXX2NtSXXM*RYX2S9M**2"2,e*RXX2SSW*RYX2SSW 
1*RYXSSMJ/ (SXXW*8XX2N»RXX2SStN**2} ) 
ANOo ANDF/2,0 

FHEGRES sANOAtVAHeGRS-'ERROR) /ERROR 
PREGRESapRBFC2,0,ANDF,FREGRES) 
PRINT 14,FREGRES,PREGRES 
l« FORMAT(//#53X*THE F-RATIO FOR A TEST OF HOMOGENIETY OF REGRESSION I 

IS *F6»3,/#5X»*wlTH AN ASSOCIATED PROBABILITY LEVEL OF *F8,3) 
O VINTER = YSD*YSD*( CI ,0''«YX*RYX-RYX02*RYX02»RXX02*RXX02 + 2,a*RYX*RYX02 
El\IC l*WXXi}2)/(U0«RXX02*RXX02) ) 
AN0»N»3 



f-lNT£Rs( AKOF/3,l3)*(ViNTER-tkROR)/eRH0R 
PINTERS PRbFCi,M, ANUr pFINTER) 
PRINT iSf f INTER, PINTER 
15 FORMAT (//,bX,*THE HYPOTHESIS OF COMMON REGRESSION CONSTANTS IS TES 
ITEO BY AN F-RATIO OF *F8,3/5X,* WHICH HAS AN ASSOCIATED PR08ABJLIT 
2Y LEVEL OF *F8,3) 
AD = A0>Ae2 
BX0ai8Xl«BX2 
BX20SBX21-BK22 
QUA09;BXD*BXD«'4,id*BX2D*AD 
IFCQUAO.GE.0,0)20f 19 

19 PRINT 21 

21 F0RMAT(//,5X,*THE REGRESSION CURVES FOR GROUPS 1 AND 2 DO NOT INTE 
IRSECT,*) 

GO TO 22 

20 IF(QUAO, £0.0,0)23,24 

23 PTla •8X0/(2,0*8X20) 
PRINT 25,PTi 

25 F0RMATC//,5X,*THE REGRESSION CURVES iNTERStCT AT A POINT WHERE X I 
IS EaUAL TO * F6,3) 

GO TO 22 

24 PTi= (SQRT(QUAO)-BXD}/ (2,0*8X20) 
PT2=(»BXD-SQRT (QUAD) )/C2,0*BX2D) 
PRINT 2b,PTl,PT2 

2b FORMAT (//,SX,*Tnt REGRESSION CURVES INTERSECT AT THE POINTS KHERE* 
1/*X IS EQUAL TO *F6,3* AND WHERE X IS EQUAL TO *F6,3) 

22 Rlal,0/( 1,0-RXX12*RXX12) 
K2ai,0/(1 ,0-RXX22*RXX22) 
Sls2,0*RXXl2/SQRT(XlSS*X12SS) 
S2*2,0*RXX22/SQRT(X2SS*X22SS) 
FACTOR»f SL*ERROR/ANDF 

X4C0EF a8X2D*BX2Dt FACTOW*(RJ / X12S8 ♦ R2/X2288) 
X5C0EF s2,0*BXD*BXO2D ♦FACTORAt Rl*Sl ♦ R2*S2) 

X2C0EFF aBXD*BXD*2,0*AO*BX2D-FACTOR*( Rl/XlSS ♦ R2/X2SS • Rl*Si*Xl 
IM • R2*S2*X2M m 2,0*Xi2M*Rl/Xl2SS - 2 , 0*X«2M*R2/X22SS) 

XICOEF b2,0*aD*BXO •FACTOR* CRl*Sl*X12H ♦Ha«S2*X22M*2,0*Rl*XlM/XlSS 
1*2,0*R2*X2M/X2SS) 

CONSTA= AD*AO «• FACTOR *( 1,0/AM +1,0/AN2 ♦X1M*XIM*RI/X1SS + R2* 
1X2M*X2H/X2SS Rl*Si*XlM*X12M • R2*S2ftX2M*X22M ♦ R 1*X12M*X12M/X 12S 
2S + R2*X22H*X22M/X22S3) 

AA(l)8 xacoEF 

AA{2)= X3C0EF 

AA(3)s X2C0tF 

AAC4)s XICOEF 

AAC5)»: CONSTA 

CALL ZPOLYR (AA,a,Z,Z) 

PRINT 55, Z 
55 FORMAT (8(2X,Fa,a)) 

NSTl=NST2=33 

CALL AMINMAX(X,N,XMIN,XMAX) 
PRINT 2000, XM1N,XMAX 
2000 FORMAT (5X,*XMIN s *F8,3* XMAX = *F6,3) 

26 CALL PARAB(BX21,BXl,A0l,XPTLNl,YPTLNl»XMIN>XMAX,AAl,N5Tn 
31 CALL PARAB(BX22,HX2, A02#XPTLN2,YPTLN2,XMXNf XHAX,AA2,NST2) 
33 CALL 86NPLT 

CALL PLTCl, 0,1, 0,-3) 

CALL SCALE (X(n,ia,0,N,n 

CALL SCALE (Yd), 9,0,N,n 

QXl (3)aQX2(3)8QX3( i) =0X4(3) sXPTLNl (NSTl+l)sXPTLN2(NST2*l)BX(N*l) 
O QXl(4)s0X2(4)=QX3(4)aQX4(4)aXPTLNl (NSTl*2)?XPTLN2(NST2*2)sX(N+2) 
ERICQVi(3)aQY2(3)=QY3C3)aQY4C3)»YPTL^Jl (N8Tl + l)!sYPTLN2(NST2*l)sY(N*l) 
"»"0Yl<4)50Y2(*)«QY3{4)aQy4(4)»VPTLNl(NSTl*2)»YPTLN2CN8T2*a)«lYlN*2) 



CALL AXi;j (y,ii5,«3,ar iSHPREOlCTOR V ARI ABLE » I ftp 1 0 , 0# 0 , Br X ( N+ 1 ) * X t K»*2 
1)) 

CALL AXIS (tl,0,«,M,iSHCRn£« ION V AKIABLE» 18»9. 0,90,0, Y (N* I Y ) 

Xl(Nl + l)=xa(N2*l)5X(\' + l) 

Xl(Ni+2)aX2(N2+£)=X(N*2) 

Yl(Nl+l)«Y2(N2+l)sY(N+l) 

YlCNi*2)»Y2CN2*2)aYCN*2) 

00 902 K a \,H{ 

PRINT 9BJ,XICK),Y1(K),K 
9Vil FOKMAT (2XrF6,3,5X,F8,3i2XrI2} 
902 CONTINUE 

PRINT 910 

00 90a K s l,N2 

PRINT 905? X2CK) ,Y2(K) ,K 
9ld5 FO««AT (2X,Fa,i,5X»F8.3,2X,I2) 
90a CONTINUE 

910 FORMAT (5X,*THIS IS GROUP ONE DATA*) 
PRINT 911 

9U FORMATC* THIS IS GROUP TWO DATA*) 

CALL LlN£(Xl,Yl,Nl,lr«'lf 11) 

CALL LINECX2,y2,N2,i ,«l,5) 

PRINT 900, (XPTLN1(J),YPTLN1(J),J=1,NST1) 
900 rORMAT ( 5 ( 2X , f 8 « 3 , 2 X , F8 , 3 ) 3 

PRINT 9i)0, (XPTUN2CJ) »YPTUN2CJ),J=l,NST2) 

CALL LINECXPTLM »YPYI.N1,N3T1,1,0,0) 

CALL L1NE(XPTLN2,YPTLN2,NST2, 1,0,0) 

SXls (XPTLNl (6)'.XCN+l))/X(N+2) 

SYlc (YPTLNi(b)*YCN*l))/Y(N+2) 

SX2= (XPTLN2C6)-XCN+n)/X(N*2) 

8Y2» (YPTLN2(6)-Y(N*i))/Y(N+a) 

IF (AA1,6E,0.0)34,3S 
3a ANU1>90,0 

GO TO 36 

35 ANGls270,0 

36 IF(AA2,QE. 0,0)37, 38 

37 AN62990,0 
GO TO 39 

38 ANG2s270,0 

39 CALL SYM80LCSXl,SYl, ,07,9HGROUP 0NE,ANG1,9) 
CALL 3YMB0LC3X2,SY2, ,07,9HCROUP TW0,ANC2,9) 
TTal0,0*X(N+2)+X{N+l) 

TYs 9,0*Y(N + 2)tYCN*n 
PRINT 912,TT,TY 

912 F0RMATC5X,* TT a *Fa,a* TY ■ *F8,a) 
QXl(l)aQX2(l)aQX3(n»QX<JCl)9X(N*l) 
0Xl(2)aQX2C2)«QX3C2)«!aX4(2)s X(N*n 
0Yl(l)=aY2Cl)8QY3(naQYa(l)sY(N+l) 
OYl(2)=0Y2(2)»QY3(2)aQYaC2)« Y(N+l) 
00 70 JL » Wa 

FX(JL) a 0,0 

IF (AIMAGCZ(JL} ) ,EQ, 0,0) 7U72 
71 PX(JL) a REAL (Z(JL)) 

GO TO 70 
Ji PX(JL) a TT ♦ 10,0 
70 CONTINUE 

PRINT 913* (PX(JL)#JLal#«) 

913 FORMAT (2X,«(2X,F8,3)) 

IF(PX(n ,GE,XCN + 1) ,ANO,PXCl),LE,TT)a0»4l 

(i^ QXl (l)aQXl (2)8PX(l) 
CRir VlCl)aY(N*l) 
£^^iVl(2)»Ty 

41 IF(PX(2),GE,X(N*n,AN0,PX(2),l.E,TT)a2#4S 



a2 QX2(^)=QX2(i)sPX(2) 
QY2(i)=Y<N+i ) 
QY2(2)iT¥ 

Ui IF(PX(3) .6£,X(N*n »AN0,PX{3) stt.TT)aa,45 
4« 0X3(n = QX3(2)5PX(3) 

0V3(l)aY(N*l) 

0Y3(2)aTy 

, as IP(PX(«),6E,X(N+1),AN0,PX(4),LE.TT)«6,47 
46 QX4(l)sQXa(2)vPX(a) 

QY4(2)»TY 
a7 CONTINUE 

PRINT (0X1 (I),QYlCn#I = i,2) 

PRINT 9Ml,tQX2(I),QY2(n#I = l,2) 

PRINT 90i,(QX3Cn,QY3(n»lBl#2) 

PRINT 901, (QXaCDf QYa(I),I = l,2) 
9ldl FORMAT (/,2(bX,F8,3n 

CALL LINE(QXl/OYl,2,l,0,0) 

CALL LIN£(QX2/QY2,2, 1,0,0) 

CALL LINE(QX3,QY3,2, l,(i,0) 

CALL LINE(QX4,QYa,2,l,0,0) 

CALL ENOPLT 
IkSl CONTINUE 

EnO 

SUeKOUTiNE AMISOAT C A , B , NS , M IS DA T A J 

DIMENSION A(2(S0) ,B(2f>0),ILL(200),AAC2d0) ,BBC200 3 

IF (HISOATA,ECi,0)RETURS 

00 750 NQb1,200 
750 ILLCNO)«AACNQ)saB(NQ}s0,0 
DO 200 N«l,Nd 

IF(A(N) .EQ, 0.0, OR, B(N},EQ.0 •0)202, 200 

202 IF(HISOATA,EQ, 2)203, 20a 

203 IJKBlJK^l 
ILL(IJK)»N 
60 TO 200 

20a 1F(,N0T,A(N) ,OR,,NOT,B(N))200,205 

205 IJKsIJK+1 

ILL(IJK)aN 
200 CONTINUE 

IPdLLd), £0,0)222, 211 
2U ICOONTS0 

00 206 JCsl,NS 

00 207 JV=1,IJK 

IF(JC,EQ,ILL(JV))208,207 

207 CONTINUE 
AA(JC»ICOONT)sAtJC) 
8B(JC-ICOONT)sB(.7C) 
GO TO 206 

208 ICOONTsICOONT*! 
- 206 CONTINUE 

NSaNS«IJK 

00 209 JTsl,NS 

A(JT)sAA(JT) 

209 B(JT)38eCJT) 
IZi CONTINUE 

RETURN 
END 

SUBROUTINE SUMDATA < A , B, C, AHEAN, BMEAN, CMEAN, ASD, B80, CSD, 83A , SS6, SS 
O CRABf RAC,RBC,RSSAB,RSSAC,RSSBC,H) 
ERJC dIHENSIQN A(400),C(400)fB(400) 
™™"AH»« 



R£A6sKSAC=«SBC s ^,0 
DO 2 J s l,M 
AMEAN=AMEAN*AC J) 
BM£ANBBMeAN*B( J) 

2 CMEANsCMEAN*C( J) 
AHEANSAHEAN/AH 
BMEANsBMEAN/AH 
CMEAN9CMEAN/AH 
DO 3 K a l,M 
SSA«SSA^(A(K)»AMEAN)**2 
SS6=SS84C8(K)»BMEAN)**2 
SSC8SSC + (CCK)«CMEAN)**2 
fiSA6sKSAB*A(K)*8(K) 

R3SAB=RSSAa+CACK)tAMEAN)*CB(K)-bM£AN) 
RSAC=RS.AC + A(K} *C CK3 

RS3ACsRSSAC + (A(K)^AMEAN)*(C{K)-CME/iN) 

RS8C=KSbC*BCK)*CCK) 

3 K63aC2Ri>SSC+CbfK)«BM£AN)*(C(K)-CMEAM) 
ASO= SQKTKCSSA/CAM-i ,0)) 
dSOsSQf^Tf-- (SS3/ (AM«1 ,3) ) 

CSOsSvSRTr (SSC/(AM*1 

RA8s(RSA8/4M*AMEAN*BMtAN)/ CA8C/*BSD) 

H8Cs(RSBC/AM»BHEAN*CMEAN)/ C8S0*CS0) 

HAC=(RSAC/AM»AMEAN*CMEANJ/(ASO*CSD) 

RETURN 

END 

FUNCTION PR8F (DA,OB,FS) 
PR8Fsl,0 

lF(OA*oa*FR,EO,0.a)RETURN 

IF{FR,LT,l,0)GO TO 5 

AsDA 

BaOe 

f *FR 

GO TO 10 
5 AsDB 

BsDA 

10 AA=2,0/{9,0AA) 
BBaH.iJ/ C9,IS*B) 

ZsABSC ( (l,0«.BB)*F**0,i55533»l,0 + AA]/SQRT(8B*F**0, 66666 Z+AA)) 
IF(9,LT,«,0)Z»Z*( 1 ,0+0,i36*2**4/B**3) 

PR8F30,5/Ci ,®*Z*C0, 196854+2* (0,1 l5l9a+Z*(0,0BS34a+Z*a,0l9527))))** 

la 

IF(FR,LT,l,0)PR8F»l,0«PRBF 

RETURN 
END 

SUBROUTINE AMI NMAX ( A , M, AMIN, AMAX ) 

DIMENSION A(400} 

AMINSA (1) 

AMAXaA(l) 

DO 2 J » 2,M 

IF CA( J),LT,AMIN)AMINSA( J) 
IF CA( JD ,GT,AMAX) AMAX»A(J) 

2 CONTINUE 
RETURN 
END 

SUBROUTINE PAf< A8 C A A , BB, CC , XPT , YPT, AMIN, AMAX , A A A, NST) 
DIMENSION XPT(33), YPT(33) 
Q -,F(AA,EQ,0)a,i 

FRICRINT 5 

!iB^'OHMAT(SX,*THE COEFFICIENT OF X«SQUAREO 18 ZERO.*) 



RETURN 

AKa("AA)*£ttB*8a*a,a*AAiiiCC)/{a,t!*AA*AA) 

AAAal,0/(a»B*AA) 
YPT (DbYPT C33)aAK + 6,0*AAA 
YPT £2)«YPT (32)sAK+5,5*AAA 
YPT (358YPT C31)aAK«5,a*AAA 
YPT (a)«Y?T C30)aAK*a,9*AAA 
YPTt5>»YPTt29)=AK+a,0*AAA 
YPT (&)fYPT (28)=AK*3,5*AAA 
YPT (7)*YPT (27)aAK*5,a*AAA 
YPT (8)«YPT (2fc)aAK*2,5*AAA 
YPT (9)»YPT (25)aAK+2,a*AAA 
YPT{1B)»YPT (24)aAK+l,5*AAA 
YPTCin«YPT (23)sAK*l ,a*AAA 
YPT112)5YPT (22)aAK*0,83*AAA 
YPT(13)aYPT (21 )aAK*aeb7*AAA 
YPT(ia)aYPT t20)3AK*0,S0*AAA 
YPTCib)cYPT £19)=^K*0,33*AAA 
YPTdbjpyPT 118)=AKt0,17*AAA 
YPT£17)sAK 
DO b L.= Wlb 

FTC SQRT(i4,0*AAA*(YPT(t)-AK)) 

UN =3a-U 

XPT(t)sH»PT 

6 XPTtLN)sH*FT 
XP7Ci7)B H 
ICaJCB0 

00 7 K « I #16 
tKs34«K 

IF(XPT(K),l,T,AMlN)ICaIC*l 

7 IF(XPT(l.K) ,6T,AMAX) JCaJC+l 
IF(IC,CTa0)9,i0 

9 NSTa53»lC 
00 8 M8i,33 
JKLsM+IC 
XPT(M)5!XPT£JKL) 
e YPT(M)=YPT (JKll 
IM IF (JCpGT,0)ll»i2 

11 NSTSN37-JC 

12 CONTlNUfc v 
RETURN 

END 
ENO 



ERIC 



